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P-i Abstract 

T^ ^ The top squark (stop) may dominantly decay to a bottom quark and a chargino if the mass 

difference between the stop and the hghtest neutralino is comparable or less than the top quark 

T-H mass. Such a moderately compressed spectrum is a challenging scenario for the stop search at 

!>■ the Large Hadron Collider, because it is difficult to separate the signals from the top and anti-top 

background. In this paper we focus on the di-lcptonic decay channel, and consider many kinematic 

variables as possible discriminators. These include several Mt2 variables and new "compatiblc- 

r^ masses" variables which fully utilize all kinematic information of the background. We use several 

.• sample spectra with different characteristics to study the efficiencies of these variables in distin- 

/-^ guishing the signal from the background. The finding is that different combinations of variables 

(y~^ or strategies should be used for different spectra to maximally enhance the signal significance and 

T-H expand the reach of the stop search in this scenario. The new variables that we proposed in this 

^ paper are also useful for other new physics searches with di-leptonic top and anti-top events as the 

•^ dominant background. 

X 



00 



1 Introduction 

The hierarchy problem of the standard model (SM) has become more prominent with the discovery of a 
light Higgs boson at the Large Hadron Collider (LHC) |1,2|. On the other hand, no new physics which 
could address the hierarchy problem has shown up in the searches at the LHC so far. Supersymmetry 
(SUSY) is widely viewed as the most promising solution to the hierarchy problem. However, the 
canonical SUSY scenarios are already highly constrained by the null results of many SUSY searches at 
the LHC. In particular, the gluino and the first two generation squarks are constrained to be heavier 
than ~ 1 TeV if they have the standard i?-parity conserving decays pHG] . The third generation squarks, 
on the other hand, have weaker bounds. Not only the mass reaches are lower than those of the first two 
generation squarks, but also in the low mass region there is still plenty of parameter space not being 
experimentally excluded, due to the more complicated decay patterns and SM backgrounds. Since 
they have the largest direct couplings to the Higgs fields, they are the most relevant superpartners 
for the hierarchy problem. Naturalness arguments strongly prefer them to be light, independent of 
any particular SUSY scenario. Therefore, they have been the focus of intensive experimental and 
phenomenological investigations nowadays. 

Studies of stop searches have been performed by many groups in various (fully hadronic, single- 
lepton, di-lepton) channels in recent years [7-42 . The LHC has done dedicated stop searches 43-48 



but so far has not found any evidence for the stop particles. As the fine tuning in the Higgs sector 
increases with the stop mass, an intriguing possibility is that the stop is not very heavy, but the 
mass spectrum of the stop and other lighter superpartners is somewhat compressed and the event 
signals from stop decay do not look very different from the SM backgrounds. In that case, simple 
variables often used in SUSY searches, such as EJ^^^'^ and the effective mass, are usually not very helpful 
in discriminating signals from backgrounds. It is therefore desirable to look for more sophisticated 
variables which may have more discriminating power. In this paper, we focus on such scenarios, where 
the mass difference between the stop and neutralino is close to or smaller than the top mass. In this 
case, the decay of the stop to a bottom quark and a chargino can be dominant, since the decay to a 
top and a neutralino is closed or phase-space suppressed. We only consider model-independent direct 
stop pair production by assuming all other colored superpartners are heavy. The signal event topology 
that we study is then a symmetric decay chain in which a pair of stops both decay through a chargino, 
which decays into a stable neutralino plus a W gauge boson. Furthermore, we focus on the di-leptonic 
channel, in which both VF-bosons (from the decay of charginos) decay to a lepton-neutrino pair. 



The ATLAS collaboration at the LHC has done such a search with 13 fb of data at 8 TeV 44 



(and an update with 21 fb [47] just appeared while we finahze our paper). In that analysis, an 



Mj'2 149,50 variable defined with the leptons, is the primary variable throughout the analysis. The 
Mj'2 of leptons has an end point around M^ for the ti background, and in general has a strong 
correlation with the pr's of the leptons. Hence, it is very useful when the signal has harder leptons 
than the background does. However, depending on the mass spectrum, the signal does not necessarily 
have harder leptons, especially for the compressed scenario. It is important to explore different regions 
of parameter space, including those regions that would produce soft leptons (but maybe hard 6-jets), 
which would in general require different strategies and kinematic variables. In this paper we choose a 
few representative points in the parameter space with different characteristics and try to identify the 
best strategy for each case. The rest of the paper is organized as follows. In Sec. [2| we present the 
sample signal spectra, and describe the details of event simulations and basic selections. In Sec. [3| 
we define many kinematic variables which can be constructed from the kinematic information of the 
events. The distributions of the signals and the background in these variables are shown so that we 
can see which variables can be useful for different mass spectra. In Sec. |4j we perform a detailed 
study of these kinematic variables one by one and also in combinations to compare their performances 
in various scenarios. We identify the best variables and strategies for each of the sample spectra. 
The conclusions are drawn in Sec. [5j Some details of the definition and computation of a new set of 
variables are described in Appendix [A) 

2 Sample Spectra, Event Simulations and Basic Event Selections 

The stop particles can be produced in pairs at the LHC via the QCD interaction. If i?-parity is 
conserved, they will decay into other lighter i?-parity odd superpartners. In this paper, we assume 
that some of the neutralinos and charginos are lighter than the stop and focus on the challenging 
scenario with a relatively small mass difference between the lightest stop and the lightest neutralino: 
mw < mr — m^o ^ Trif. If the mass difference is large, the visible decay products will be very 
energetic and easily distinguishable from the SM backgrounds, then the search is mostly limited by 
the production rate. On the other hand, if the spectrum is highly degenerate, the decay products 
may be too soft to pass the triggers or experimental cuts. One has to rely on the mono-jet search 
to place bounds in that case. Assuming a light chargino sitting in between the stop and neutralino 
masses, m^o < m-± < nii , the dominant decay channel of the stop is likely to be ti — )■ bxi with 
the chargino Xi decaying into x? plus an on-shell or off-shell W gauge boson. Concentrating on the 
case of leptonic decays for both W bosons, we have the signal final state to be 2h + 21!. -\- E^^^^. The 
detailed Feynman diagram for the di-leptonic stop signal is shown in the left panel of Fig. [l} The 
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Figure 1: Left: Feynman diagrams of stop pair production with both stops decaying through a chargino 
and with 2 leptons final states. Right: Feynman diagrams of ii pair production with two leptons final 
states, which is the main background after some basic cuts. 



dominant background for the stop search in this channel is the tt production with leptonic decays in 
both decay chains after requiring a minimum of the missing traverse energy, as shown in the latest 
ATLAS analysis |44[|47] (though the ATLAS analysis used a cut on a M'T2 variable instead of missing 
transverse energy to suppress other backgrounds). The corresponding Feynman diagram is shown in 
the right panel of Fig. [Ij In our study, we focus on the it background, and look for suitable kinematic 
variables which can effectively separate the signal from the tt background in order to improve the stop 
search in this channel. 

The 6-jet and lepton momenta in the final state follow from the two mass differences: mi — Tn-± 
and m-± — m^o. The equivalent mass differences for the tt background are mt — Myy and Myy — nii, 
which are fixed by the top and W masses. As a consequence, we anticipate that for different spectra 
one may use different kinetic variables to improve the search. To illustrate this point, we study several 
representative spectra based on whether the 6-jet and the lepton are harder or softer in the final 
state. We choose two different mass gaps between the stop and the neutralino. The first one has 
300 — 120 = 180 GeV, which is close to the top quark mass. The other one has 250 — 120 = 130 GeV 
with a smaller mass gap than that of the top background and generically softer leptons and 6-jets. 
For each fixed mass gap between the stop and the neutralino, we study three different cases with the 
chargino mass close to the stop mass, close to the neutralino mass, or at a point with similar mass 
differences as the W gauge boson mass between the top and the neutrino masses. The six sets of 
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Table 1: Six representative signal spectra according to their similarities to the ft background. In the 
last two columns, the labels, "softer", "harder" and "comparable", mean the comparison of the 6-jet 
or lepton momenta between the signal and the ti background. 



masses are shown in Table [T| where we label the six different spectra from SI to S6. We also highlight 
the characteristic features of the 6-jet and lepton momenta by comparing them to the momenta in the 
ft background. We choose the mass differences such that even for softer 6-jets or leptons, a significant 
fraction of them can still pass the cuts and register in the signal events, otherwise alternative search 
strategies will be needed. All of these six spectra are not ruled out by the ATLAS search at 8 TeV with 



13 fb , although the spectrum S3 may have been excluded by the 21 fb update 47 . We want to 



emphasize that the current search strategies at ATLAS 44 using the M'j'2 variable constructed from 



the harder leptons will only be sensitive to the S3-like spectra. For other type of spectra, different 
variables are generally required to distinguish the signal from the background. We will come back to 
this point in Section |4j 

Our detailed collider studies are based on the 8 TeV LHC with 22 fb~^, which is roughly the total 
integrated luminosity collected by either ATLAS or CMS for the 8 TeV run. Signal and background 
events are generated using MadGraphS |51], and showered in PYTHIA [52] . We use PGS [53] to perform 
the fast detector simulation with modified 6-tagging efficiencies that roughly match the latest ATLAS 



6-tagging efficiency 54 . For signal events, we do not include the r leptons from W gauge boson 



decays, but they are kept in the background events. This is because this type of background may 
become important once the background with direct electrons and muons from W decays are suffi- 
ciently suppressed. The signal production cross section is normalized to be the value calculated at 
NLO+NLL '55] p] We assume the stop decays to a bottom quark and a chargino with 100% branching 
ratio. The tf production cross section is normalized to be 238^24 pb, calculated at NLO+NNLL 44 . 
We generated 10^ events for each signal spectrum, and 5 x 10^ events for the tf di-leptonic background 



'https : //twiki . cern . ch/twiki/bin/view/LHCPhysics/SUSYCrossSections8TeVstopsbottom 



(which is close to the total number of the corresponding background events of 22 fb~^ integrated 
luminosity) . 

For the basic cuts on the objects, we closely follow the ATLAS analysis |44] : electrons are required 
to have pT > 20 GeV and \7]\ < 2.47; muons are required to have pT > 10 GeV and \r]\ < 2.4; jets are 
required to have px > 20 GeV and It^I < 2.5. Any jet within AR = y Ac/)^ + Ar/^ = 0.2 of an electron 
is discarded. Any electron or muon within AR = 0.4 of any remaining jet is discarded. We require 
exactly two opposite-charge leptons (electrons or muons), and at least one lepton with pT > 25 GeV. 
The invariant mass of the two leptons need to be larger than 20 GeV. If the two leptons have the 
same flavor, the invariant mass is further required to be outside the 71 — 111 GeV window to cut out 
on-shell Z backgrounds. Two additional variables, A^min and A^h, are defined in Ref. |;44| as follows: 
A<l*min is the azimuthal angle difference between the p™"^'' vector and the closest jet three-momentum; 
A^h is the azimuthal angle difference between the p™"^*^ vector and the vector p^ = p™"^*^ -|- p^^ -|- pj^^ . 



We require A<l>jnin > 1 and A^^ < 1.5. The ATLAS analysis in Ref. 44 has focused on the energetic 
leptons from the signals and did not impose any cut on the 6-tagged jets. In contrast, we will treat 
the 6-jets on the same footing as the lepton, because they are required to construct other variables for 
different spectra. Specifically, we require at least two 6-tagged jets. If there are more than two 6-jets, 
which rarely happens, we simply select the leading two 6-jets for our analysis. The 6-jet requirement 
also reduces backgrounds which contain no 6-jet. Different from the ATLAS analysis in Ref. f44] where 
a lower Mt2 cut was used, we impose a cut on the E^^^^ to be above 40 GeV to suppress additional 
SM backgrounds like Z-|-jets, for which a small E^^^^ can be induced from the mis-measurement of 
jets. 

After the basic cuts, we present the numbers of signal and background events as well as the initial 
significances s/vb in Table l2l To obtain the normalized events at 22 fb~^, we adopt the following 
signal production cross sections at the 8 TeV LHC: 2.00 pb for m^ = 300 GeV and 5.60 pb for 
m^ = 250 GeV. As can be seen from Table pi none of the six spectra can be excluded at 90% C.L. at 
the 8 TeV LHC with just the basic cuts. 

3 New Kinematic Variables 

In this section, we discuss some traditional and new kinematic variables which may help with the stop 
search in the chargino decay channel, and examine the distributions of the signal and tt background 
events in these variables. Because of the diversities of the signal spectra, we do not anticipate that 
any single variable can cover all spectra. Therefore, we explore as many variables as possible and 
identify which set of variables will work best for each spectrum. We categorize the variables that 
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Table 2: The signal and background events after the basic cuts at the 8 TeV LHC. The second column 
corresponds to the numbers of signal and background events normalized to the 22 fb~^ luminosity. 
The signal significances are shown in the last column. Signal spectra, S1-S6, have beed described in 
Table [H 



we consider into three different classes: basic variables, Mr2-based variables, and new "compatible- 
masses" variables which can be viewed as generalizations of Mt2 variables but using all available 
kinematic constraints. We will describe each class in sequence. There are also variables which depend 
on different spins of the top and the stop. A spin-correlation variable, the azimuthal angle between 
the two leptons, is also investigated at the end of this Section. 

3.1 Basic Variables 

As we saw in Table[T]that different spectra have different pt^s for 5-jets and charged-leptons. The pt^s 
of visible particles can certainly be used to reduce the tt background. The missing transverse energy 
in a signal event comes from both the neutralinos and neutrinos, while in the background event it 
only comes from the neutrinos. Consequently, we may expect some differences in E^^^^ between the 
signals and the background. We show their distributions for the tt background and signals for all six 
spectra in Fig. l2l where the horizontal axis starts from zero E^^^^ . From Fig. ui we can see that for 
SI, S2 and S3, the signal events turn out to have harder distributions of S™^^ than the background 
and E^^^^ could be useful to improve the search. For S4, S5 and S6 with a smaller mass difference 
of m^ — m^o, the difference is much smaller, so cutting on EJ^^^^ is not likely to improve the signal 
significance. 

Another commonly used kinematic variable is the so-called effective mass variable, defined as 



M, 



eff 



i=bi,b2,h,i2 



c 
o 



fl^ 0.04 



-A 



■J 







50 



100 150 

B^'""" (GeV) 



200 



250 





0.10 




J 


=u. 










— 


54 - 

55 : 


s 


0.08 




:: 


■■■-- 


■" 


- 










S6: 

B 


^ 
^ 


0.06 
0.04 






- 


- 


"' 




- 


:JT 












0.02 
0.00 


^ 


- 










•4 


% 


^ 


'i^V.ri 


- 



50 



100 150 

E^-^-^ (GeV) 



200 



250 



Figure 2: E^^^^ distributions of the six different signals and the tt background. 



which can capture the generic center-of-mass energy of each event and is usuahy useful for searching 
for new physics with heavy particles. We show the signal and background distributions in M^s in 
Fig. |3j We see that SI, S2, and S3 have somewhat harder distributions than the background. On 
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Figure 3: Mgfj distributions of the six different signals and the tt background. 

the other hand, S4, S5, and S6 on average have smaller M^s than the background due to the small 
splitting between the stop and neutralino masses. 

3.2 Mr2-based Variables 



A second class of variables is based on the Mt2 variable discussed in the literature 149 , 50 , 56 57 



Mt2 uses the visible particle momenta and p^^^^ to find an optimized transverse mass for both decay 
chains. Specifically for the final state of our interests, one can define the M|>2 variable using the two 



lepton momenta and p™'^^ as 



Mj^2 — ^11 ^ [J max 

,pi+P2=p5;i- 



mriPT ' Pi)> "^t(Pt ) P2) 



(2) 



where the mass of missing particle is set to be zero, as would be in a tt background event. The 
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Figure 4: Mj^2 distributions for the six signals and the ti background (B). The vertical black line 
indicates the W gauge boson mass. The vertical purple line shows the suggested lower cut to increase 
the S3 signal significance. As shown in Ref. 58 , Mj.2 '^i^l vanish if p^^^^ lies in between the two 



vectors p^ and P2 in the transverse plane. This explains the accumulation of evens at zero. 

signal and background event distributions are shown in Fig. [4} Comparing the signal and background 
behaviors in terms of M|,2, we can see that SI, S4 and S5 have smaller values. This is essentially due 
to their softer lepton momenta. As a result, imposing a lower limit cut on M^2, like what has been 



done in the ATLAS search 44 , can only decrease the signal significance. On the other hand, the 



spectrum S3 does have a harder lepton and a cut of requiring Mj^2 above a certain value can increase 
the signal significance. This is indeed the case. We found that imposing M^2 > 85 GeV, the signal 
significance s/vb of S3 increases from 0.88 to 2.94. This increase is very significant and is because 
the ti background has M^2 bounded from above by the W gauge boson mass (denoted by the vertical 
black lines in Fig. |4]). 

A similar variable M^g can be constructed from replacing p^ by p^ and p™'*^^ by p™"^^ + p^ + p^ 
in Eq. (|2| and assuming the missing particle mass as the W gauge boson mass. We show the Mj.2 
distributions for signals and tt background in Fig^ For the tt background, the M^-g distribution has 
an end-point at the top quark mass, as indicated by the vertical black line. The relative signal and 
background distributions follow from the hardness of the 6-jet. Since the SI signal has the hardest 
5-jet, its M^2 extends to a larger value. Imposing a lower limit cut on Mj.2 can therefore substantially 
increase the signal significance. For example, a cut of M^2 > 190 GeV, denoted by the vertical blue 



line in Fig ^ can provide an improvement of sjyh from 0.48 to 1.27. 
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Figure 5: M-^g distributions for the six signals and the it background (B). The vertical black line 
indicates the top quark mass. The vertical blue line shows the suggested lower cut to increase the SI 
signal significance. 

One additional -M|i2 variable can be defined by treating the summation of one 6-jet momentum and 
one charged-lepton momentum together as a single effective visible particle and the missing particle in 
each decay chain as massless. There is a combinatorial ambiguity for this variable on how to choose the 
two possible pairs. In our analysis, we choose the pairing with a smaller value of the larger invariant 
mass of the 6-jet momentum and the lepton momentum between the two pairs for each possible paring. 
This choice is made based on the fact that for the correct combination the 6-jet and the lepton come 
from the same mother particle decay, so their invariant mass should be limited by the mother particle 
mass. The signal and background distributions are shown in Fig. [6} For the SI, S2, S3 distributions, 
one can see that the hardness of 6-jets, leptons and E™^^'^ can make the signal M^^ distribution extend 
beyond that of the tt background. On the contrary, the S4, S5 and S6 signal distributions have their 
majority of M^2 below the top quark mass. So, to increase the signal significance for these spectra, 
one may want to impose an upper limit cut on M^^ instead. 

3.3 Compatible-masses Variables 

Each of the Mt2 variables discussed above only uses a part of the full kinematic information of 
each event. For the signals and background in our study, the two decay chains are symmetric. One 
can in principle use the three equal on-shell-mass constraints in the two-step decays at the same 
time to further distinguish signals from background [59]. For example, we can concentrate on the tt 
background and define variables to fully utilize the top quark, W gauge boson, and neutrino mass 
constraints. A simple counting of the number of unknowns and the number of equations indicates that 
the unknown neutrino momenta can be obtained by solving the coupled equations up to a four-fold 



9 



o 



2 

Oh 




100 150 200 

M" (GeV) 



250 



o 







100 150 200 

Mli' (GeV) 



250 



Figure 6: -M^g distributions for the six signals and the tt background (B). The vertical black line 
indicates the top quark mass. 



ambiguity 60 . As the Mt2 can be interpreted as the minimal mother particle mass compatible with 

we can 



the kinematics of an event for an assumed event topology and the daughter particle mass 56 



consider the two mass-squared differences in the two-step decays that are compatible with all visible 
particle momenta and the equal-on-shell-mass constraints. They define an allowed region in the two- 
dimensional parameter space for each event. There is more than one way to extract useful variables 
from such distributions. We choose to define Ai and A2 as the compatible mass-squared differences 
which minimize their sum (i.e., the mass-squared difference between the first particle and the last 
particle in the two-step decay chain). Similar to the M^2 variable, there is a combinatorial issue 
for this variable in choosing the two possible pairs of 6-jet and lepton momenta. In our analysis, we 
choose the pair with a smaller value of the larger invariant mass of the 6-jet momentum and the lepton 
momentum as in the case for the M^2- The detailed definition and computation of these two variables 
are described in Appendix |A| For the ti background, Ai and A2 provide an estimate of M^ — ?7i^ and 
iTif — M^^ respectively. It turns out, even after the detector smearing and the ambiguity of the event 
reconstruction, Ai and A2 have a scattered distribution near the true mass-squared differences. 

For the signal events, A2 will be close to m? — m'^±, while Ai is closely related to but does not have 
a simple formula in terms of m-± and m^o because of the two different sources contributing to the 
total missing transverse energy. As a result, the signal distributions are expected to be more scattered 
than the background. We show the scatter plots of -v/Ai and \/A2 for the six different signals vs. 
background in Fig.[7| Looking at these scatter plots, we can already see that SI and S6 have the center 
of points dramatically different from the tt background. For the signal SI, we found that imposing a 
cut of \/Ai < 70 GeV and \/A2 > 180 GeV (indicated by the blue vertical and horizontal lines) can 
increase the significance from 0.48 to 1.32. For the signal S6, imposing a cut \/A2 < 100 GeV can 
increase the significance from 1.19 to 2.07. 
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Figure 7: Scatter plots for Sl-3 and tt background (B) in the (\/Ai, \/~^) plane. 500 points are shown 
for both the signal and the background. 
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3.4 A Spin Correlation Variable 



In addition to the spectrum difference, one may also explore possible kinematic effects due to different 
spins of the top and the stop to distinguish signals from the background. Such effects can arise from 
both production and decays. Top quarks are generally produced with larger rapidity gaps than the 
stops and Ref. [26| has exploited this as a possible discriminator of top and stop events. However, 
it suffers from large systematic uncertainties and the result was not particularly promising. Another 
effect that might be useful is the spin correlation. The top quark has spin-1/2 and the top-anti-top pair 
is produced with correlated polarizations. The leptons in the final states inherit the spin correlation 



from the top quarks and can be used to constructed variables sensitive to this effect 25,26, 36] . On 



the other hand, stops are scalars and the decays from the two stops are not correlated at all. In 
Ref. [26], the azimuthal angle difference between the two leptons, A<p£+£-, has been found to be 
useful for the case with stop decaying to the top quark and neutralino. We apply this variable to our 
case with the stop decaying to the bottom quark and chargino and plot the distributions of A(j)£+£- 
for the six different signals and the tt background in Fig. [sj For all signals and background, the 
A(/)£+£- distributions tend to peak at vr, indicating that the two leptons prefer to move in the opposite 
directions. Although some signals have slightly sharper distributions than the background, we have 
checked that cutting on A(j)£+i- does not improve the signal significance even when combined with 
other variables. Therefore, we will not include this spin-correlation variable in the subsequent studies 
in Sec. Hjwhen we consider combinations of variables. 
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Figure 8: A(j)£+i- distributions for the six different signals and the tt background. A(/>;+;- is the 
azimuthal angle difference of the two leptons in the final state. 
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4 Variable Performance and Improvements on Different Spectra 

Having described the individual variables, we now study their performance. For some spectra there 
is no single variable that works well by itself, so we also combine several variables together and study 
the improvement on the stop search. Instead of searching for the best cut by hand, we use the Boost 
Decision Tree (BDT) method 161] to identify the best variable or the best combination of variables 
for a given spectrum. Starting from the signal significance after basic cuts in Table [2j we show the 
changes of the signal significance s/v6 as a function of the signal efficiency for different combinations 
of variables in Figs. [9] and 10 . We have defined different combinations of variables as 



• combo-all: the combination of all variables, 

• combo- A: the combination of E^^^^, M^s and A variables, 

• combo- Mr2: the combination of E^^^^, Mefj and Mt2 variables, 

• combo-pT: the combination of £^™'^^, M^s and px variables, 

• Ef''''' k Meff : the combination of only £;f?'"" and M^s- 

In the region with the signal efficiency e^ > 0.1 and the background efficiency ej, > 0.01, where the 
statistics of events can be trusted, combo-all always provides the best s/yh. The right vertical label 
of these plots in Figs. ^ and 10 is e^Z-y/efc, which directly indicates how much we improve the signal 
significance from the basic selection of Sec. [2| For SI, S2 and S3, the largest value of s/yh has a 
small value of the signal efficiency at around 0.1-0.2, which suggests that the optimized cut prefers 
to reduce both signal and background events. On the contrary, for S4, S5 and S6, the largest value of 
s/yb has a pretty large value of signal efficiency, which suggests that the best cuts do not occur at 
the tail of the distributions. 

To identify which variable is the best one and how it behaves in each group, we also show the best 
variable in each of the A, Mt2 and pT groups, which consist of 

• for the A group: Ai, A2, V^i + A2 and tan"^ [A2/A1], 

• for the Mt2 group: M!^^, M^^ and M^^^, 

• for the pt group: individual pt's, p^ = Pj: +Px ^ Pt ^ Pt ~^ Pt ^^'^ p™*'° = Pt/Pt- 

For the moderately compressed spectra considered in this paper, there is no single variable that can 
reach the same level of improvement as the combination of all variables does. Our results suggest that 
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Figure 9: The signal significance as a function of the signal efficiency e^ after the Boost Decision Tree 
optimization for Sl-3. The right vertical label is the efficiency ratio eg/^/eb, indicating the relative 
improvements on top of the basic cuts. 
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the best strategy for the experimental analysis in this more difficult region of the model parameter 
space should at least combine several kinematic variables to obtain the maximal exclusion reach. 
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Table 3: s/v6 and sjh after the optimized cuts from BDT at 22 fb ^, with the requirement that the 
signal and the background efficiencies to be e^ > 0.1 and e?, > 0.01. 
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Table 4: The same as Table [3j but requiring e^ > 0.1 and ej, > 0.005. 

For SI, the variable M^2 turns out to be the best single variable as we expected. The combination 
of A variables and the combination of Mt2 variables are both quite useful. The combination of all 
variables is even better, indicating that the correlation of A variables and Mt2 variables are not too 
strong. As far as the performance of a single variable is concerned, M^^ can be used to improve the 
search, although its increase on sjyh is mild as can be seen from the dotted orange curve of the first 
plot in Fig. |9j The improvement from a combination of all variables is very dramatic and can increase 
s/\/6 by more than a factor of five. To make sure that the improvement numbers are statistically 
reliable, we require e^ > 0.1 and e^ > 0.01(0.005), which means that there are more than 100(50) 
background events left, and show the signal significance of the optimized cuts in Table pMl). 

For S2, both 6-jet and lepton momenta from the signal are comparable to the background. For 
the region that we trust our statistics of simulated events, there is no single variable can substantially 
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increase the significance. Combo- -Mr2 shows a peak structure at the low signal efficiency region, which 
is mainly due to the variable Mj.2- Combo- A is not that useful and only increases the significance 
by around 20%. Again the combination of all variables does show a moderate improvement and can 
increase the significance by a factor of 1.74 from Table pMl). 

For S3, the story is very simple. The charged lepton momenta from signal events are generally 
harder than the background. Although the basic kinematic variables do not improve the signal sig- 
nificance, the M^2 variable is seen to increase s/yb dramatically. The improvement curve from the 
combination of all variable follow the curve from Mj^2 only. The enhancement factor on s/vb can 
reach 2.0(2.5) from Table |3]|4]) . 

For S4, the previously discussed combinations of variables, except the combination of all variables, 
can hardly improve the signal significance at all. However, the combination of all variables does 
show an impressive improvement with a factor as large as 1.87 from Table ^R. To identify the 
subset of variables relevant for such an improvement, we have also tried other combinations and found 
that p^ + M^2 can do almost as well as the combination of all variables. We show the performance of 



pj.+Mj^2 by the green solid curve in Fig. 10, which almost matches the performance of the combination 
of all variables. Furthermore, We have found that the combination of E^^^^, M^g, p^, Mj^2 ^^'^ ^i 



can reach the utmost performance closely. The brown solid curve in Fig. 10 shows the performance of 



this combination, labeled as "combo-mix." To further understand the behaviors of p^ and M^2 fo^ S4, 
we show the p^ histogram distribution in the left panel of Fig. 1 1 and the scatter plot of p^ and Mj^2 



in the right panel. We can see from the scatter plot that the S4 signal events have a concentration at 
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Figure 11: Left: pip (a scalar sum of the px^s of the two leptons) distributions for S4-6 and the tt 
background (B). Right: the scatter plot for S4 and B in the {M^2iPt) pl^-iis. 

smaller values of p^ and M^2- ^ cut of M^2 < 30 GeV and p^ < 75 GeV (indicated by the two red 
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lines in the right panel of Fig. 11) increases s/v6 from 0.77 to 1.28. 

For the spectrum S5, there is no variable that provides a substantial improvement of the signifi- 
cance. Combo- Afr2 provides a better significance than other combinations, and can do almost as well 
as the combination of all variables. The optimized cuts can increase s/vh by a factor of 1.36 and have 
projected signal significance to be 1.65. As a result, this spectrum could be excluded at 95% C.L. at 
8 TeV with a 22 fb~^ luminosity. 

For the last spectrum S6, the single variable A2 is better than any other single variables. Combo-A 
and combo- M7^2 have similar performances and either one can provide a significant improvement. The 
combination of all variables can further improve the signal significance by around 20% and have the 
total improvement of 2.19. So, for this type of spectra, the new A variables proposed in this paper 
(discussed in details in the Appendix [A|) are highly recommended in the real experimental analysis. 

For the spectra S4-S6, reducing the tt background in general requires upper limit cuts on the 
variables that we considered because of the softer signal events. One might be concerned about other 
types of backgrounds not included in this study which may also have softer energies. Most of them 
have been heavily suppressed by our basic cuts. In addition, the best improvements for S4-S6 occur at 
relatively high signal efficiencies where the other small backgrounds should not make a large difference. 

5 Conclusions 

In this paper, we study the stop search in the chargino decay channel with direct stop production. 
We focus on the challenging scenario where the spectrum of the superpartners involved in the decay 
is moderately compressed. The overall transverse momentum of the visible particles (2 leptons -|- 
2 6-jets) is similar to or somewhat smaller than that of the ti background, making it difficult to be 
distinguished from a ti event. However, depending on the intermediate chargino mass, the distribution 
of the individual lepton or 6-jet momentum can have different behaviors from the ti background. We 
studied many kinematic variables, including the simple traditional variables such as E^^^^^ Afeffi and 
individual particle pt's, the Mt2 variables, and the new compatible-masses variables (Ai^2) which 
use all on-shell conditions of the ti event topology. We found that different variables are useful for 
different spectra. As a summary, we list the signal spectra, their characteristics of the 6-jet and lepton 
momenta and the best variable(s) of each signal spectrum in Table pi 

A general conclusion from Table [5] is that different variables should be used for different spectra. 
Specifically, if either 6-jets or leptons of the signal are harder than the corresponding ones of the ti 
background, a single Mt2 variable can improve the stop search significantly. For example, one could 
use M^2 for SI and M|,2 for S3. For some spectra like S4, the leptons from the signal are softer than 
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Table 5: A summary of the best variables for the six different spectra. 

the background, the variables p^ or Mj.2 can still be useful to improve the search if one imposes an 
upper limit cut on these two variables. Similarly for 86, the 6-jets from the signal are softer. Imposing 
an upper limit cut on the variable A2 can improve the search. For S2, both 6-jets and leptons are 
comparable to those of the background and there is no single variable that works well. For S5, although 
both b-jets and leptons are softer than the background, the differences are relatively small and there 
is also no single variable that works well. However, a combination of many variables can still give 
some improvement for the S2 and S5 spectra. 

In our study, we have assumed that all visible particles can pass the basic cuts and be captured 
for a significant fraction of the signal events. This may no longer be true if the splitting of any two 
superpartners in the decay chain becomes very small, then the visible particle from the corresponding 
decay may be too soft to be detected. In that case, one can only rely on the visible particles from the 
other step of the decay (e.g., 6-jets if the chargino and neutralino are too degenerate, or leptons if the 
stop and chargino are too degenerate). The Mt2 variables might still be useful if the distributions 
of the signal and background are sufficiently different. In the limit that all three superpartners are 
degenerate, few visible particles are hard enough to pass the basic cuts, one has to revert to the 
mono-jet or mono-photon search with an initial state radiation. 

For a final remark, we would like to emphasize that the kinematic variables studied in the paper 
can also be applied to other new physics searches as long as the dominant background is the tt in the 
dileptonic channel. For example, one can apply them to the t' search with t' ^ b+W and W decaying 
leptonically. 
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Figure 12: Left: Two-step symmetric decay chains. The diagram also includes the notations that 
we use. Right: A typical allowed region (blue) in the (-v/Ai, -V/A2) plane, where rriN is set to zero, 
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mj^. The red curve corresponds to a constant value of my = Ai + A2. 



We scan over the allowed region to find the point that gives the minimum my- For a fixed rriN, a 
constant my corresponds to a constant Ai + A2, which is a quarter-circle in the (-v/Ai, -V/A2) plane. 
When the quarter-circle is tangent to the compatible region (as shown by the red curve), the point of 
intersection gives the minimal compatible my. 
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A Minimal Compatible Masses for Two-step Symmetric Decay Chain 

In this Appendix we give a detailed definition of the "compatible-masses" variables Ai and A2 and 
describe how to calculate them. We consider pair production of some particle Y, and each of which 
goes through two-step decays. It first decays to an intermediate particle X plus a visible particle and 



then X decays to an invisible particle A^ and another visible particle. The process is shown in Fig. 12 
which also exhibits how we label the particles. The masses of Y, X, N are denoted as my, m,x, and 
rriN, that will be treated as unknowns. Particles 3, 4, 5, 6 are visible and whose 4-momenta can be 
experimentally measured. For a set of test values of {mY,m,x,mi\f), we can solve for the 4-momenta 
of missing particles 1 and 2 when Y, X, N are on-shell. To see this, let us first focus on one decay 
chain (e.g., the chain with particles 1, 3, and 5). The mass shell conditions are 
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2 2 

Pi = mj^, 



Taking the differences between tfie mass sliell equations we liave 



{pi +P3+P5f = mY. 
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(3) 

(4) 
(5) 



(6) 
(7) 



and we further define A'^^ and Ag to simplify the expressions. 

Expanding the 4-momenta the equations can be written explicitly as 



2 ' 



A' 



(8) 
(9) 



E1E3 - pixPSx - PlyPZy - PlzPSz 
E1E5 - pixPbx - PlyPby - PlzP5z 

Combining these two equation we can eliminate Ei and express piz in terms of pi^ and piy, 

Piz = Apix + Bpiy + C, (10) 

where 
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~^ Pix ~^ Ply ~^ Piz ^^^ -^1- (10) back into Eq. (9), we obtain a quadratic 



pI^ + 26 pixPiy + cply + 2d pix + 2f piy + g = 0, 

a = El{l + A^)-{p5x+P5zAf, 
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(12) 
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g = El{ml + C^)-{p^,C+^)\ 



A' 



(13) 
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and A, B and C are defined in Eq. (11). The quadratic equation (12) describes an ellipse in the 
iPix,Piy) plane. 

We can do the same for the other chain and obtain a similar equation in terms of p2x and p2y, 
which also describes an ellipse in the {p2x,P2y) plane. We can then use the knowledge of the total 
missing transverse momentum (i^rp = piT + P2t) to put both ellipses on the same {pixiViy) plane, 
which are describe by two quadratic equations: 

aix^ + 2hixy + ciy'^ + 2dix + 2fiy + gi = 0, (14) 

aa x^ + 262 xy + C2 y^ + 2d2 x + 2/2 ^ + 52 = 0. (15) 

where for notation simplicity we have written {pix^Piy) as (x, y). The coefficients are functions of niY-, 
mx and rriN (or equivalently, Ai, A2 and m]\j). The two quadratic equations render 4 solutions in 
general which may be complex. In order to have physical (real) solutions for the momenta, the two 
ellipses must intersect. 

For a given event, one can check whether a set of trial masses (Ai, A2, rriAr) is compatible by testing 
whether the two ellipses intersect. This can be done efficiently without solving the actual equations by 



using the Sturm sequence |62j, similar to the calculation of M'j'2 in R-ef. 56 . If we fix m-Tv = (which 
is a good assumption for the ti di-leptonic background since the missing particles are neutrinos) , we 
have two parameters Ai and A2 left and we can scan over the (Ai, A2) plane to find the compatible 
region with that particular event, i.e., any point outside the compatible region does not have physical 



solution for the kinematics. A typical compatible region for an event is shown in Fig. 12 There are 
in principle multiple ways to extract useful information or variables from the compatible region. We 
consider a simple projection to a pair of variables which are the coordinates of the point at the tip 
(with minimum Ai + A2 = niY — fn^) of the compatible region. It can be obtained by scanning 
over the allowed region to find the point that gives the minimum my- We output the coordinates 
(Ai,A2) of that point as our variables. Equivalently, we can also use the radial coordinates niY and 
tan^^(A2/Ai) as the independent variables. 

For the ti di-leptonic background, particles Y , X, N correspond to the top, VF-boson and neutrino. 
The signal does not have the correct topology for this set of variables. Nevertheless, since it has the 
same signature as the background (2 lepton + 2 5-jets +Ej^^^^), we can calculate these variables anyway 
and obtain a set of output values (Ai, A2) for each event. Even for the wrong topology, these variables 
still have a strong correlation with the hardness of the 6-jets and leptons. 

A detailed description of the calculation of (Ai, A2) including how to test whether two ellipses 
intersect, and the computer codes for calculating them can be found at the following website: 
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https : //sites . google . com/a/ucdavis . edu/mass/. 
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